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GENOTYPING OF HUMAN CYP3A4 
Introduction 

5 Cytochrome P450 enzymes are a heme-containing family that play central roles in 

oxidative, peroxidative and reductive metabolism of numerous endogenous and exogenous 
compounds, including many pharmaceutical agents. Substances known to be metabolized 
by P450 enzymes include steroids, bile acids, fatty acids, prostaglandins, leukotrienes, 
biogenic amines, retinoids, lipid hydroperoxides, phytoalexins. phamiaceuticals, 
10 environmental chemicals and pollutants. P450 substrates also include natural plant 
products involved in flavor, odor, flower color, and the response to wounding. P450 
enzymes and other dmg-metabolizing enzymes maintain steady-state levels of endogenous 
ligands involved in ligand-modulated transcription of genes effecting growth, apoptosis, 
differentiation, cellular homeostasis, and neuroendocrine functions. The metabolism of 
15 foreign chemicals by P450 enzymes can produce toxic metabolites, some of which have 
been implicated as agents responsible for birth defects and tumor initiation and progression. 

The P450 gene superfamily is likely to have evolved from an ancestral gene present 
before the prokaryote/eukaryote divergence. The number of individual P450 genes in any 
mammalian species is estimated at 60 to 200. The CYP2C and CYP3A subfamilies are 
20 unique in that they are present in large amounts in human liver microsomes, and there are 
many fomis in each subfamily. Several human cDNAs encoding CYP3A proteins have been 
identified. The most important of these are CYP3A4, CYP3A5 and CYP3A7. CYP3A4 and 
CYP3A7 genes are 87% homologous by amino acid and 95% homologous by nucleotide 
sequence, while CYP3A4 and CYP3A5 are only 88% homologous in the coding region. 
25 CYP3A4 and CYP3A7 are 91% homologous in the 5'-flanking sequences, differing by the 
presence of a unique P450NF specific element (NFSE) and a P450HFLa specific element 
(HFLaSE). respectively (Hashimoto et al, 1993). 

. It has been shown that polymorphisms in the CYP2D6 gene correlates with enzyme 
activity measured by phenotyping with dextromethorphan or debrisoquine (Sachse et ai 
30 (1 997) Am. J. Hum nen^f 60:248-295). 

The CYP3A subclass catalyzes a remarkable number of oxidation reactions of 
clinically important drugs such as quinidine, warfarin, erythromycin, cyclosporin A, 
midazolam, lidocain, nifedipine, and dapsone. Current estimates are that more than 60% 
of clinically used drugs are metabolized by the CYP3A4 enzyme, including such major dmg 
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Classes as caldum channel blockers, immunosuppressors, macrolide antibiotics and 
anticancer drugs, see Brian etaL (1990) Biochemistry 29 ii^fln-ii9Q9 

Expression profiles for each member of this family varies significantly. CYP3A4 is 
expressed in all adult human liver and intestine, accounting for more than 60% of total P450 
5 in both organs. Expression is inducible in vivo and in vitro by numerous compounds such 
as rifampicin. barbiturates and glucocorticoids. In kidney, CYP3A4 is expressed 
polymorphically. CYP3A4 expression is sex-influenced, as females have 24% greater 
expression than males. CYP3A5 is detected in 10-30% of Caucasian adult livers, and 
expressed constitutively in adult kidney. CYP3A5 expression does not appear to be sex- 

10 influenced and only moderately inducible by xenobiotics both in vivo and in vitro. CYP3A7 
is expressed in fetal liver but only in 25% of adult livers. Molecular mechanisms responsible 
for the developmentally specific expression of CYPSA's are unknown. 

Since the rates of metabolism of drugs, toxins, etc. can depend on the amounts and 
kinds of P450s expressed in a tissue, variation in biological response may be determined 

15 by the profile of expression of P450s in each person. Analysis of genetic polymorphisms 
that lead to altered expression and enzyme activity are therefore of interest 

Summary OF THE Invention 
Genetic sequence polymorphisms are identified in the human CYP3A4 gene. 
20 Nucleic acids comprising the polymorphic sequences are used in screening assays, and for 
genotyping individuals. The genotyping information is used to predict the rate of metabolism 
for CYP3A4 substrates, and the effect that CYP3A4 modulators will have on such 
metabolism. The information allows better prediction of dnjg interactions, and effective dose 
for an individual. 

25 

Database References for Nucleotide Sequences 
Genbank accession no. S74700 provides the CYP3A5 5* genomic region. Genbank 
accession no. D11131 provides a partial sequence of the human cytochrome P-450IIIA4 
gene. Genbank accession no. Ml 8907 (cDNA) provides the cDNA sequence of a human 
30 CYP3A4 allele. Sequences of the CYP3A4 gene are provided in the SEQLIST as follows: ' 
cDNA sequence as SEQ ID N0:1. the encoded polypeptide as SEQ ID N0:2, the promoter 
region as SEQ ID N0:3, intron 3 as SEQ ID N0:4, intron 4 as SEQ ID N0:5, intran 6 as 
SEQ ID N0:6, exon 7, intron 7 as SEQ ID N0:7, intron 10 as SEQ ID NO:8, intron 11 as 
SEQIDN0:9. 

35 
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Description of the Specific Embodiments 
Pharmacogenetics is the linkage between an individual's genotype and that 
individual's ability to metabolize or react to a therapeutic agent. Differences in metabolism 
or target sensitivity can lead to severe toxicity or therapeutic failure by altering the relation 
5 between bioactive dose and blood concentration of the drug. Relationships between 
polymorphisms in metabolic enzymes or drug targets and both response and toxicity can be 
used to optimize therapeutic dose administration. 

Genetic polymorphisms are identified in the human CYP3A4 gene. Nucleic acids 
comprising the polymorphic sequences are used to saeen patients for altered metabolism 
10 for CYP3A4 substrates, potential drug-daig interactions, and adverse/side effects, as well 
as diseases that result from environmental or occupational exposure to toxins. The nucleic 
acids are used to establish animal, cell culture and in vitro cell-free models for dmg 
metabolism. 

Definitions 

is to be understood tiiat this invention is not limited to the particular methodology, 
protocols, cell lines, animal species or genera, constructs, and reagents described, as such 
may vary. It is also to be understood that the terminology used herein is for the purpose of 
describing particular embodiments only, and is not intended to limit the scope of the present 
invention which will be limited only by the appended claims. 
20 As used herein the singular fomis "a", "and", and "the" include plural referents unless 

the context cleariy dictates othenwise. Thus, for example, reference to "a construct" 
includes a plurality of such constructs and reference to "the CYP3A4 nucleic acid" 
includes reference to one or more nucleic acids and equivalents thereof known to those 
skilled in the art, and so forth. All technical and scientific terms used herein have the 
25 same meaning as commonly understood to one of ordinary skill in the art to which this 
invention belongs unless cleariy indicated othenwise. 

CYP3A4 polymorphic sequences. It has been found that specific sites in the 
CYP3A4 gene sequence are polymonahic, i.e. within a population, more than one nucleotide 

30 (G. A. T, C) is found at a specific position. Polymorphisms may provide functional 
differences in tiie genetic sequence, tiirough changes in the encoded polypeptide, changes 
in mRNA stability, binding of transcriptional and translation factors to the DNA or RNA, and 
the like. The polymorphisms are also used as single nucleotide polymorphisms (SNPs) to 
detect genetic linkage to phenotypic variation in activity and expression of the particular 

35 protein. 
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SNPs are generally biallelic systems, that is. there are two alleles that an individual 
may have for any particular marker. SNPs, found approximately every kilobase. offer the 
potential for generating very high density genetic maps, which will be extremely useful for 
developing haplotyping systems for genes or regions of interest, and because of the nature 
5 of SNPs, they may in fact be the polymorphisms associated with the disease phenotypes 
under study. The low mutation rate of SNPs also makes them excellent markers for studying 
complex genetic traits. 

In order to provide an unambiguous identification of the specific site of a 
polymorphism, sequences flanking the polymorphic site are shown in the tables, where the 
10 5* and 3' flanking sequence is non-polymorphic, and the central position, shown in bold, is 
variable. It will be understood that there is no special significance to the length of non- 
polymorphic flanking sequence that is included, except to aid in positioning the 
polymorphism in the genomic sequence. 

The sequence of at least one allele of human CYP3A4 is known in the art, and 
15 accessible in public databases, as cited above. This sequence is useful as a reference for 
the genomic location of the human gene, and for specific coding region sequences. The 
subject polymorphic sequences are provided in Table 3. and include the CYP3A4- 
A392/CYP3A4-G392 alternative forms, which are associated with differences in expression 
level of the polypeptide. As used herein, the tenn "CYP3A4 gene" is intended to refer to 
20 both the wild-type and variant sequences, unless specifically denoted otherwise. 

Nucleic acids of particular interest comprise the provided variant nucleotide 
sequence(s). For screening purposes, hybridization probes may be used where both 
polymorphic fonns are present, either in separate reactions, or labeled such that they can 
be distinguished from each other. Assays may utilize nucleic acids that hybridize to one or 
25 more of the described polymorphisms. 

The genomic CYP3A4 sequence, including specific transcriptional and translational 
regulatory sequences, such as promoters, enhancers, etc., including about 1 kb. but 
possibly more, of flanking genomic DNA at the 5' end of the transcribed region, is of 
particular interest. The promoter region is useful for detennining the pattern of CYP3A4 
30 expression, e.g. induction and inhibition of expression in various tissues, and for providing 
promoters that mimic these native patterns of expression. A polymorphic CYP3A4 gene 
sequence, ^.e. including one or more of the provided polymorphisms, is useful for expression 
studies to detemriine the effect of promoter and/or intron sequence variations on mRNA 
expression and stability. The polymorphisms are also used as single nucleotide 
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polymorphisms to detect genetic linloge to phenotypic variation in activity and expression 
ofCYP3A4. 

As used herein, the term ACYP3A4 gene® is intended to generically refer to both the 
wild-type (reference) and variant fomis of the sequence, unless specifically denoted 
5 othenwise. As it is commonly used in the art, the term Agene® is intended to refer to the 
genomic region encompassing the 5' UTR, exons, introns. and the 3* UTR. Individual 
segments may be specifically refen-ed to, e.g. exon 2. intron 5, etc. Combinations of such 
segments that provide for a complete protein may be refen-ed to generically as a protein 
coding sequence. 

10 The promoter region of CYP3A4 contains a number of sequence motifs for binding 

transcription regulatory factors. These include a basic transcription element (SEQ ID NO:1, 
nt. 1054-1071); octamer motif (SEQ ID N0:1, nt. 975-982); TATA box (SEQ ID N0:1, nt. 
1075-1081); HNF-5 site (SEQ ID N0:1, nt. 913-920); estrogen responsive elements (SEQ 
ID NO:1, nt 735-750, 895-908); CAAT box (SEQ ID N0:1, nt. 843-848); p53 binding site 

15 (SEQ ID N0:1, nt 721-735); AP-3 binding site (SEQ ID NO:1, nt 682-693); NFSE site (SEQ 
ID N0:1, nt 810-819); and progesterone/glucocorticoid responsive element (SEQ ID N0:1, 
nt. 870-883). Regulatory sequences can be used to identify trans acting factors that 
regulate or mediate CYP3A4 expression. 

Fragments of the DNA sequence are obtained by chemically synthesizing 

20 oligonucleotides in accordance with conventional methods, by restriction enzyme digestion, 
by PGR amplification, etc. For the most part, DNA fragments will be of at least 15 nt, usually 
at least 20 nt, often at least 50 nt. Such small DNA fragments are useful as primers for 
PGR, hybridization screening, etc. Larger DNA fragments, i.e. greater than 100 nt are useful 
for production of the encoded polypeptide, promoter motifs, etc. For use in amplification 

25 reactions, such as PGR, a pair of primers will be used. The exact composition of primer 
sequences is not critical to the invention, but for most applications the primers will hybridize 
to the subject sequence under stringent conditions, as known in the art. 

The GYP3A4 nucleic acid sequences are isolated and obtained in substantial purity, 
generally as other than an intact mammalian chromosome. Usually, the DNA will be 

30 obtained substantially ft-ee of other nucleic acid sequences that do not include a GYP3A4 
sequence or fragment thereof, generally being at least about 50%, usually at least about 
90% pure and are typically "recombinant", i.e. flanked by one or more nucleotides with which 
it is not nonmally associated on a naturally occurring chromosome. 

-5- 
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CYP3A4 polypeptides. The CYP3A4 genetic sequence, including polymorphisms, 
may be employed for synthesis of a complete CYP3A4 protein, or polypeptide fragments 
thereof, particularly fragments conresponding to functional domains; binding sites; etc.; and 
including fusions of the subject polypeptides to other proteins or parts thereof. For 
5 expression, an expression cassette may be employed, providing for a transcriptional and 
translational initiation region, which may be inducible or constitutive, where the coding region 
is operably linked under the transcriptional control of the transcriptional initiation region, and 
a transcriptional and translational temiination region. Various transcriptional initiation 
regions may be employed that are functional in the expression host. The polypeptides may 
10 be expressed in prokaryotes or eukaryotes in accordance with conventional ways, 
depending upon the purpose for expression. Small peptides can also be synthesized in the 
laboratory. 



Substrate: a chemical entity that is modified by CYP3A4 oxidation, usually under 
15 normal physiological conditions. Most of these substrates are lipophilic compounds. 
Although the duration of dmg action tends to be shortened by metabolic transfonnation, 
drug metabolism is not "detoxification". Frequently the metabolic product has greater 
biologic activity than the dmg itself. In some cases the desirable pharmacologic actions are 
entirely attributable to metabolites, the administered dmgs themselves being inert. Likewise, 
20 the toxic side effects of some drugs may be due in whole or in part to metabolic products. 
The range of known substrates for CYP3A4 is very broad, including steroids, e.g. 
testoterone, estradiol, mifepristone; tricyclic antidepressants, e.g. amitriptyline, 
clomipramine, imipramine, desipramine ;SSRI, e g, citalopram, fluoxetine, fluvoxamine, 
paroxetine and sertralin; bile acids; protease inhibitors, e.g. saquinovir, indinavir fatty acids; 
25 prostaglandins; leukotrienes; biogenic amines; retinoids; lipid hydroperoxides; phytoalexins; 
antibiotics, e.g. erythromycin; taxanes, e.g. paclitaxel, docetaxel; anti-hypertensives, e.g. 
diltiazem; environmental chemicals and pollutants, felodipine. rifabutin, haloperidol, 
tiiazolam, terfenadine. lovastatin, chlorzoxazone, alprazolam, etc. 

30 Modifier. A chemical agent that modulates the action of CYP3A4, either through 

altering its enzymatic activity (enzymatic modifier) or through modulation of expression 
(expression modifier). In some cases the modifier may also be a substrate, thereby inducing 
its own demise. Selective inhibitors of CYP3A4 include ketoconazole and troleandomycin. 
Other P450 selective inhibitors include venlafaxine, clarithromycin, fluconazole, itraconazole. 

35 ritonavir, orphenadrine, methimazole, midazolam, gestodene. etc. Recent studies have 
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shown that orally administered grapefruit juice is an expression modifier of CYP3A4. acting 
to specifically down-regulate expression in enterocytes (Lown et al. (1997) J. Clin. Invest 
99:2545-2553). 

Recent studies (Schuetz and Schuetz (1996) Mol Pharmar.ni 49:311-318) on 
5 expression of P-glycoprotein and CYP3A4 showed that both proteins were up-regulated 
after treatment with many drugs, including rifampicin, phenobarbital, clotrimazole, reserpine. 
and isosafirole. P-glycoprotein was up-regulated by midazolam and nifedipine, whereas 
CYP3A4 was not. Azotes appear to be broad spectrum inhibitors of cytochromes P450. 

10 Pharmacokinetic parameters. Pharmacokinetic parameters provide fundamental 

data for designing safe and effective dosage regimens. A drug's volume of distribution, 
clearance, and the derived parameter, half-life, are particularly important, as they determine 
the degree of fluctuation between a maximum and minimum plasma concentration during 
a dosage interval, the magnitude of steady state concentration and the time to reach steady 

15 state plasma concentration upon chronic dosing. The pharmacokinetics of drugs often vary 
considerably between individuals, largely because of variations in the expression of CYP 
enzymes in the fiver and other tissues. Parameters derived from in vivo drug administration 
are useful in detemiining the clinical effect of a particular CYP3A4 genotype. 

20 Expression assay. An assay to determine the effect of a sequence polymorphism 

on CYP3A4 expression. Expression assays may be perfonned in cell-free extracts, or by 
transfonning cells with a suitable vector. Alterations in expression may occur in the basal 
level that is expressed in one or more cell types, or in the effect that an expression modifier 
has on the ability of the gene to be inhibited or induced. Expression levels of a variant 

25 alleles are compared by various methods known in the art. Methods for determining 
promoter or enhancer strength include quantitation of the expressed natural protein; 
insertion of the variant control element into a vector with a reporter gene such as 
P-galactosidase, luciferase, chlorarTiphenicol acetyltransferase, etc. that provides for 
convenient quantitation; and the like. Specific constmcts for detemiining promoter strength 

30 of CYP3A4 are described in Hashimoto et al. (1993) Eur. J. Biochem 218:685-595. 

Gel shift or electrophoretic mobility shift assay provides a simple and rapid method 
for detecting DNA-binding proteins (Ausubel, P.M. et al. (1989) In: Current Protocols in 
Molecular Biology. Vol. 2, John Wiley and Sons, New York). This method has been used 
widely in the study of sequence-specific DNA-binding proteins, such as transcription factors. 

35 The assay is based on the observation that complexes of protein and DNA migrate through 

-7- 
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a nondenaturing polyacrylamide gel more slowly than free DNA fragments or 
double-stranded oligonucleotides. The gel shift assay is performed by incubating a purified 
protein, or a complex mixture of proteins (such as nuclear or cell extract preparations), with 
an end-labeled DNA fragment containing the putative protein binding site. The reaction 
5 products are then analyzed on a nondenaturing polyacrylamide gel. The specificity of the 
DNA-binding protein for the putative binding site is established by competition experiments 
using DNA fragments or oligonucleotides containing a binding site for the protein of interest, 
or other unrelated DNA sequences. 

CYP3A4 is known to be expressed in liver, e.g. embryonic liver, mature hepatocytes; 
10 duodenal tissue, e.g. mucosal epithelial cells; and other epithelial cells throughout the 
digestive tract; breast tissue; placental tissue; lung tissue, e.g. bronchial glands, bronchiolar 
columnar and temiinal epithelium, type II alveolar epithelium and alveolar macrophages, etc. 
Hepatic levels of CYP3A4 can be estimated by an erythromycin breath test, and vary by at 
least 10-fold among patients. 

15 

Substrate screening assay. Assays to determine the metabolic activity of a CYP3A4 
protein or peptide fragment on a substrate. Many suitable assays are known in the art, 
including the use of primary or cultured cells, e.g. epithelial cells from liver, intestine, etc., 
genetically modified cells where the native CYP3A4 alleles are altered or inactivated, cell- 

20 free systems, e.g. microsomal preparations or recombinantly produced enzymes in a 
suitable buffer, or in animals, including human clinical trials. Clinical trials may monitor 
serum, urine, etc. levels of the substrate or its metabolite(s). 

Typically a candidate substrate is input into the assay system, and the oxidation to 
a metabolite is measured over time. The choice of detection system is detemiined by the 

15 substrate and the specific assay parameters. Assays are conventionally run, and will 
include negative and positive controls, varying concentrations of substrate and enzyme, etc. 



Genotyping: CYP3A4 genotyping is performed by DNA or RNA sequence and/or 
hybridization analysis of any convenient sample ft-om a patient, e.g. biopsy material, blood 

30 sample, scrapings ft^m cheek, etc. A nucleic acid sample from an individual is analyzed for 
the presence of polymorphisms in CYP3A4. particularly those that affect the activity or 
expression of CYP3A4. Specific sequences of interest include any polymorphism that leads 
to changes in basal expression in one or more tissues, to changes in the modulation of 
CYP3A4 expression by modifiers, or alterations in CYP3A4 substrate specificity and/or 

35 activity. 

-8- 
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Unkage Analysis: Diagnostic screening may be performed for polymorphisms that 
are genetically linked to a phenotypic variant in CYP3A4 activity or expression, particulariy 
through the use of microsatellite markers or single nucleotide polymorphisms (SNP). The 
microsatellite or SNP polymorphism Itself may not phenotypically expressed, but is linked 
5 to sequences that result in altered activity or expression. Two polymorphic variants may be 
in linkage disequilibrium, i.e. where alleles show non-random associations between genes 
even though individual loci are in Hardy-Weinberg equilibrium. 

Linkage analysis may be performed alone, or in combination with direct detection of 
phenotypically evident polymorphisms. The use of microsatellite markers for genotyping is 
10 well documented. For examples, see Mansfield et al. (1994) Genomics 24:225-233; and 
Ziegle etal. (1992) Qenomics 14:1026-1031. The use of SNPs for genotyping is illustrated 
in Golevleva et al. (1996) Am. J. Hum fiBnst 59:570-578; and in Underbill etal. (1996) 
P.N.A.S. 93:196-200 

15 Transgenic animal. The subject nucleic acids can be used to generate genetically 

modified non-human animals or site specific gene modifications in cell lines. The temi 
"transgenic" is intended to encompass genetically modified animals having a deletion or 
other knock-out of CYP3A4 gene activity, having an exogenous CYP3A4 gene that is stably 
transmitted in the host cells, or having an exogenous CYP3A4 promoter operably linked to 

20 a reporter gene. Transgenic animals may be made through homologous recombination, 
where the CYP3A4 locus is altered. Alternatively, a nucleic acid constmct is randomly 
integrated into the genome. Vectors for stable integration include plasmids, retroviruses and 
other animal viruses, YACs, and the like. Of interest are transgenic mammals, e.g. cows, 
pigs, goats, horses, etc., and particularly rodents, e.g. rats, mice, etc. 

25 

Genetically Modified Cells. Primary or cloned cells and cell lines are modified by the 
introduction of vectors comprising CYP3A4 gene polymorphisms. The gene may comprise 
one or more variant sequences, preferably a haplotype of commonly occurring 
combinations. U.S. 5,429,948, July 4 , 1995 describes the constnjction and use of a cell 

30 line that expresses multiple P450 enzymes. 

Vectors useful for introduction of the gene include plasmids and viral vectors, e.g. 
retrovlral-based vectors, adenovirus vectors, etc. that are maintained transiently or stably 
in mammalian cells. A wide variety of vectors can be employed for transfection and/or 
integration of the gene into the genome of the cells. Alternatively, micro-injection may be 

35 employed, fusion, or the like for introduction of genes into a suitable host cell. 

-9- 
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The expression vector will have a transcriptional initiation region oriented to produce 
functional mRNA, preferably the native transcriptional initiation region, e,g, including the 
polymorphism described in Table 3. Generally the vectors will include markers for selection, 
and may also comprise detectable markers operably linked to the CYP3A4 promoter, 
5 transcription cassettes for intemal controls, etc. 

Cell-free assay systems. A number of cell-free assays have been described that are 
useful in the subject invention. Yamazaki et al. (1997) Arch Biochem Biophys 342:329-337 
demonstrates reconstituted systems with recombinantly produced CYP3A4. U.S. 5,413,915 

10 describes microsomal P-450 oxidase enzyme complex dispersed in a thin film of a generally 
neutral hydrophilic film-forming binder. Substrates are converted into metabolic 
intermediates that can be detected by a colorimetric indicator present in the binder film or 
an adjacent binder film and undergoing a visible color change. U.S. 5,478,723 discloses 
methods for determining the enzyme or enzymes in the human body that metabolize a 

15 particular drug by comparing microsomal fractions from different donors. 

Genotvping Methods 
The effect of a polymorphism in CYP3A4 gene sequence on the response to a 
particular substrate or modifier of CYP3A4 is detennined by in vitro or in vivo assays. Such 
20 assays may include monitoring the metabolism of a substrate during clinical trials to 
determine the CYP3A4 enzymatic activity, specificity or expression level. Generally, in vitro 
assays are useful in determining the direct effect of a particular polymorphism, while clinical 
studies will also detect an enzyme phenotype that is genetically linked to a polymorphism. 
The response of an individual to the substrate or modifier can then be predicted by 
25 determining the CYP3A4 genotype, with respect to the polymorphism. Where there is a 
differential distribution of a polymorphism by racial background, guidelines for drug 
administration can be generally tailored to a particular ethnic group. 

The polymorphisms in the sequence of CYP3A4 provided in Table 3, particulariy the 
A to G substitution at -392, are screened for the effect of the polymorphism on expression. 
30 Several effects are of interest, including basal expression levels in different tissues, 
alterations in enzyme activity or specificity, and the induction or inhibition of expression by 
modifiers. The latter is of particular interest in determining dmg-drug interactions. In 
particular, pharmacokinetic dnjg interactions with antimicrobials are common because of the 
tendency to prescribe them in combination with other therapies. 

-10- 
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Tissue specific differences in expression are of interest because the metabolism of 
drugs can vary with the route of administration. For example, certain orally administered 
drugs are affected by the CYP3A4 expression level in enterocytes, while the same drug 
administered intravenously is only affected by hepatic expression levels of CYP3A4. 
5 The basal expression level in different tissue may be determined by analysis of tissue 

samples from individuals typed for the presence or absence of a specific polymorphism. For 
example, the CYP3A4 mRNA or protein level in hepatocytes, gastrointestinal epithelial, etc, 
is detenmined. Any convenient method may be use, e.g. ELISA, RIA, eft:, for protein 
quantitation, northern blot or other hybridization analysis, quantitative RT-PCR, etc, for 
10 mRNA quantitation. The tissue specific expression is con^elated with the genotype. 

Altematively, basal expression levels are determined by expression assays for the 
particular promoter sequence, as previously described. The assays may be performed with 
the CYP3A4 coding sequence or with a detectable marker sequence. To detemiine tissue 
specificity the assay is performed in cells derived from different sources. 

15 The alteration of CYP3A4 expression in response to a modifier is determined by 

administering or combining the candidate modifier with an expression system, e.g. animal, 
cell, in vitro transcription assay, etc. The effect of the modifier on CYP3A4 transcription 
and/or steady state mRNA levels is detemiined. As with the basal expression levels, tissue 
specific interactions are of interest. Correlations are made between the ability of an 

20 expression modifier to affect CYP3A4 activity, and the presence of the provided 
polymorphisms. A panel of different modifiers, cell types, etc. may be screened in order to 
determine the effect under a number of different conditions. 

A CYP3A4 polymorphism that results in altered enzyme activity or specificity is 
determined by a variety of assays known in the art. The enzyme may be tested for 

25 metabolism of a substrate in vitro, for example in defined buffer, or in cell or subcellular 
lysates, where the ability of a substrate to be oxidized by CYP3A4 under physiologic 
conditions is detemnined. Where there are not significant issues of toxicity from the 
substrate or metabolite(s), in vivo human trials may be utilized, as previously described. 
The genotype of an individual is determined with respect to the provided CYP3A4 

30 gene polymorphisms. The genotype is useful for determining the presence of a 
phenotypically evident polymorphism, and for detemiining the linkage of a polymorphism to 
phenotypic change. 

A number of methods are available for analyzing nucleic acids for the presence of 
a specific sequence. Where large amounts of DNA are available, genomic DNA is used 
35 directly. Alternatively, the region of interest is cloned into a suitable vector and grown in 
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sufficient quantity for analysis. The nucleic acid may be amplified by conventional 
techniques, such as the polymerase chain reaction (PGR), to provide sufficient amounts for 
analysis. The use of the polymerase chain reaction is described in Saiki et al. (1985) 
Science 239:487, and a review of cun-ent techniques may be found in Sambrook et al. 
5 Molecular Cloning: A Laboratory Manual. CSH Press 1989. pp. 14.2-1 4.33. Amplification 
may be used to detemnine whether a polymorphism is present, by using a primer that is 
specific for the polymorphism. Alternatively, various methods are known in the art that utilize 
oligonucleotide ligation as a means of detecting polymorphisms, for examples see Riley et 
ai (1990) N.A.R. 18:2887-2890; and Delahunty et al, (1996) Am J. Hum. 
10 SeQet.58:1239-1246. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2\7'-dimethoxy-4',ff- 
dichloro-6-carboxyfiuorescein (JOE), 6-carboxy-X-rhodamine (ROX). 6-carboxy-2\4'.7'.4,7- 
15 hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N',N'-tetramethyl-6- 
cariDoxyrhodamine (TAMRA), radioactive labels, e.g. 32P, 35S, 3H; etc. The label may be 
a two stage system, where the amplified. DNA is conjugated to biotin, haptens, etc. having 
a high affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner 
is conjugated to a detectable label. The label may be conjugated to one or both of the 
20 primers. Alternatively, the pool of nucleotides used in the amplification is labeled, so as to 
incorporate the label into the amplification product. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art. The nucleic acid may be sequenced by dideoxy or 
other methods. Hybridization with the variant sequence may also be used to determine its 
25 presence, by Southern blots, dot blots, etc. The hybridization pattern of a control and 
variant sequence to an array of oligonucleotide probes immobilised on a solid support, as 
described in U.S. 5,445,934, or in WO95/35505, may also be used as a means of detecting 
the presence of variant sequences. Single strand confomiational polymorphism (SSCP) 
analysis, denaturing gradient gel electrophoresis (DGGE), mismatch cleavage detection, 
30 and heteroduplex analysis in gel matrices are used to detect conformational changes 
created by DNA sequence variation as alterations in electrophoretic mobility. Alternatively, 
where a polymorphism creates or destroys a recognition site for a restiiction endonuclease 
(restriction fragment length polymorphism, RFLP). tfie sample is digested with ttiat 
endonuclease, and the products size fractionated to detemriine whether the fragment was 
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digested. Fractionation is performed by gel or capillary electrophoresis, particularly 
acrylamide or agarose gels. 

In one embodiment of the invention, an array of oligonucleotides are provided, where 
discrete positions on the anray are complementary to one or more of the provided 
5 polymorphic sequences, e.g. oligonucleotides of at least 12 nt, frequently 20 nt, or larger, 
and including the sequence flanking the polymorphic position. Such an an-ay may comprise 
a series of oligonucleotides, each of which can specifically hybridize to a different 
polymorphism. For examples of arrays, see Hacia et al, (1996) Nature Genetics 
14:441-447; Lockhart a/. (1996) Nature Biotechnol . 14:1675-1680; and De Risi et a/. 

10 (1996) Nature Genetics 14:457-460. 

The genotype information is used to predict the response of the individual to a 
particular CYP3A4 substrate or modifier. Where an expression modifier, e.g. a macrolide 
dnjg, inhibits CYP3A4 expression, then drugs that are a CYP3A4 substrate will be 
metabolized more slowly if the modifier is co-administered. Where an expression modifier 

15 induces CYP3A4 expression, a coadministered substrate will typically be metabolized more 
rapidly. Similarly, changes in CYP3A4 activity will affect the metabolism of an administered 
drug. The pharmacokinetic effect of the interaction will depend on the metabolite that is 
produced, e.g. a prodmg is metabolized to an active form, a drug is metabolized to an 
inactive fonm, an environmental compound is metabolized to a toxin, etc. Consideration is 

20 given to the route of administration, drug-drug interactions, drug dosage, etc. 

The CYP3A4-A392/CYP3A4-G392 altemative forms are shown to be differentially 
distributed between broadly defined racial groups. The G fomn is more prevalent in African 
Americans, while the A fomi is more prevalent in American Caucasians and American 
Hispanics, The administration of CYP3A4 substrates and expression modifiers may be 

25 adjusted to reflect racial differences in metabolism. 



Experimental 

The following examples are put forth so as to provide those of ordinary skill in the art 
30 with a complete disclosure and description of how to make and use the subject invention, 
and are not intended to limit the scope of what is regarded as the invention. Efforts have 
been made to ensure accuracy with respect to the numbers used (e.g.amounts, 
temperature, concentrations, etc.) but some experimental en-ors and deviations should be 
allowed for. Unless othenwise indicated, parts are parts by weight, molecular weight is 
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average molecular weight, temperature is in degrees centigrade; and pressure is at or near 
atmospheric. 

MATERIALS AND METHODS 

5 DAM samples. Blood specimens from approximately 300 individuals were collected 

after obtaining informed consent. All samples were stripped of personal identifiers to 
maintain confidentiality. The only data associated with a given blood sample was gender 
and self-reported major racial group designations in the United States (Caucasian, Hispanic, 
African American). Genomic DNA was isolated from these samples using standard 

10 techniques. gONA was either stored as concentrated solutions or stored dried in microtiter 
plates for future use. 

PCR amplifications. The primers used to amplify exons 5. 6, 7, 10, 12, and the 
promoter region of the CYP3A4 gene from 200 ng of human gONA are shown in Table 1. 
Primers were designed based upon publically available cDNA and intron/exon boundary 

15 sequence, as well as intron sequences determined in our laboratory. 100 ng of gDNA from 
2 individuals was amplified with the Perkin Elmer GeneAmp PCR kit according to 
manufacturer's instructions in 100 ^\ reactions with Taq Gold DNA polymerase, vw'th one 
exception. Boehringer-Mannheim Expand High Fidelity PCR System kit was used to amplify 
intron 3. Magnesium concentrations for each PCR reaction was optimized empirically, and 

20 are shown in Table 1 Themnal cycling was performed in a GeneAmp PCR System 9600 
PCR machine (Peri^in Elmer) with an initial denaturation step at 95°C for 10 min. followed 
by 35 cycles of denaturation at 95°C for 30 sec. primer annealing at 55°C for 45 sec, and 
primer extension at 72°C for 2 min, followed by final extension at 72*'C for 5 min, with the 
following exceptions. Annealing temperature for the promoter fragment was 58°C. Cycling 

25 conditions for intron 3 were an initial denaturation at 95X for 2 min, followed by 35 cycles 
of denaturation at 94oC for 30 sec, primer annealing at 55oC for 45 sec, and primer 
extension at 68°C for 6 min, followed by a final extension at 68°C for 7 min. 

DAM sequendng. PCR products from 32 individuals, approximately 1/3 representing 
each of the . 3 major racial groups (see above), were spin column purified using 

30 Microcon-100 columns. Cycle sequencing was performed on the GeneAmp PCR System 
9600 PCR machine (Pertain Elmer) using the ABI Prism dRhodamine Terminator Cycle 
Sequendng Ready Reaction Kit according to the manufacturer's directions. Oligonucleotide 
primers used for the sequencing reactions are listed in Table 2. 8 fi\ sequencing reactions 
were subjected to 30 cycles at 96oC for 20 sec, 50°C for 20 sec, and 60°C for 4 min. 

35 followed by ethanol precipitation. Samples were evaporated to dryness at 50oC for -1 5 min 
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and resuspended in 2 ;x\ of loading buffer (5:1 deionized fomnamide : 50 mM EDTA pH 8.0). 
heated to 65°C for 5 min, and electrophoresed through 4% polyacrylamide/6M urea gels in 
an ABI 377 Nucleic Add Analyzer according to the manufacturer's instructions for sequence 
determination. All sequences were determined from both the 5' and 3* (sense and antisense) 
5 direction. 

Each sequencing reaction was perfomned with 2 individuals* DNA pooled together. 
The 16 eiectropherograms were analyzed by comparing peak heights, looking for -25% 
reduction in peak size and/or presence of extra peaks as an indication of heterozygosity. 
Each electropherogram result that suggested the presence of a polymorphism was 
10 confimied by individually resequencing each of the individuals' belonging to that pool on 
both strands. 

Population genotyping. High-throughput genotyping using TaqMan technology (ABI) 
was performed using standard techniques (Livak et al. (1995) PGR Methods and 
Applications 4:357-362) on the samples described above. The promoter region from -422 

15 to -331 was amplified using oligonucleotide primers CYP3A4_Promo1A (SEQ ID NO:10) (5'- 
TGGCTTGTTGGGATGAATTTCAAG-3') and CYP3A4_Promo1B (SEQ ID NO: 11) (5*- 
TTACTGGGGAGTCCAAGGGTTCTG-3*) at a concentration of 1 .0 mM in 25 ^1 reactions 
containing 7.5 mM MgClj. CYP3A4_A Promo 1, Fam-labeled (SEQ ID N0:12) (5- 
TTAAATCGCCTCTCTCTTGCCCTTGTCTCTAT-3') and CYP3A4_GPromo1. Tet-labeled 

20 (SEQ ID N0;13) (5*-AATCGCCTCTCTCCTGCCCTTGTCTCTAT-3') oligonucleotide probes 
at a concentration of 100 nM were incorporated into the reactions for polymorphism 
detection. Thermal cycling was performed in a GeneAmp PGR System 9600 PGR machine 
(Peridn Elmer) with an initial incubation at 50°C for 2 min, followed by an initial denaturation 
step at 95°C for 10 min, followed by 45 cycles of denaturation at 95''C for 30 sec and primer 

25 annealing/extension at 66°C for 1 min. Results were automatically read on an LS50B 
(Perkin-Elmer). 

RESULTS 

A 664 bp fragment of the human CYP3A4 gene, which included 470 bp of the 
30 promoter region and 174 bp of exon 1, and 20 bp of intron 1 was amplified and 
resequenced. An adenine (A) to guanine (G) transition was identified at position -392 (from 
the start codon) which occurred at a frequency of approximately 30% in the racially-mixed 
64 chromosomes screened by resequencing. Subsequent genotyping of 95 individuals from 
each of 3 broadly defined racial groups (African Americans, Hispanic Americans, and 
35 Caucasian Americans) produced the following allele frequencies: 
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Group 


CYP3A4-A392 


CYP3A4-G392 


American Caucasians 


.963 


.037 


American Hispanics 


.931 


.069 


African Americans 


.473 


.527 



A cytosine (C) to thymine (T) change was identified at position +52 of intron 6 which 
occun-ed at a frequency of approximately 1% in the racially-mixed 64 chromosomes 
screened by resequencing. A T to G change was identified at position +34 of intron 7 which 
occun-ed at a frequency of approximately 19% in the racially-mixed 64 chromosomes 

10 screened by resequencing. A silent mutation C to T was identified at position 579 of exon 
7 that occuR-ed at a frequency of approximately 3% in the racially-mixed 64 chromosomes 
screened by resequencing. A G to C change was identified at position -9 of intron 4 which 
occurred at a frequency of approximately 1.5% in the racially-mixed 64 chromosomes 
screened by resequencing. A G to A change was identified at position +12 of intron 10 

15 which occurred at a frequency of approximately 14% in the racially-mixed 64 chromosomes 
screened by resequencing. A C to T change was identified at position -1 1 of intron 1 1 which 
occurred at a frequency of approximately 12% in the racially-mixed 64 chromosomes 
screened by resequencing. A dinucieotide microsatelite sequence, (CA)16 was identified 
approximately 500 bp into intron 3 in a single person. Table 3 contains a summary of all the 

20 polymorphisms identified. 

A 664 bp fragment of the 5' region of CYP3A4 gene was sequenced in 64 
chromosomes representative of three major ethnic groups. The 470 bp of the promoter 
region amplified contains the TATA, the CAAT boxes and the octamer motif, as well as 

25 major regulatory elements such as the basic-transcription element, the NFSE, the p53 
binding motif, the AP-3 binding site, a progesterone-glucocorticoid and two estrogen 
response elements, and a hepatic nuclear factor-5 response element. The polymorphism 
at position -392 lies in the 7th position of the 10 bp NFSE. Evidence from previous studies 
suggest that the NFSE is part of the regulatory region for CYP3A4 transcription (Hashimoto 

30 etal. (1993) Eur J Biochem 218:585-595). 

The A to G change nucleotide change observed in the CYP3A4 NFSE at position 7 
produces the sequence found in the CYP3A5 NFSE at position 7. Because the NFSE may 
partially account for differential expression of CYP3A4 and CYP3A5, this change in CYP3A4 
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could alter levels of expression and/or tissue specificity, perhaps making it more similar to 
the expression pattern of CYP3A5. 

Allelic frequencies for the -392 polymorphism vary dramatically among the three 
populations tested. Several hypotheses may explain this phenomenon. This result may be 
5 due to genetic drift in the Caucasian and Hispanic populations that has severely restricted 
transmission, by chance alone, of the most frequent allele in African Americans. A shift in 
frequency of this magnitude seems unlikely for a locus in large human populations to 
experience simply by chance. Alternatively, a founder effect could account for this result, 
but this is also extremely unlikely for large, outbred populations collected without phenotypic 

10 selection or ascertainment bias. Another possibility is that natural selection has acted upon 
this locus, to perhaps restrict the G allele in modem Caucasian and Hispanic populations 
that originally arose from an African founder population (Cavalli-Sforza et al. The history and 
Geography of human genes. Princeton: Princeton University Press, 1994) in which the G 
allele was very common. Altennatively, the G allele may provide a selective advantage in 

15 the African environment, so that it has been maintained at a high frequency in the African 
American population that has only recently migrated from Africa. This hypothesis directly 
implies that this polymorphism affects CYP3A4 expression, and may be important in 
modulating metabolism of xenobiotic and pharmaceutical agents. 

The (CA)n repeat in intron 3 is very useful, as polymorphisms of this type usually are 

20 highly polymorphic in human populations with many alleles represented. This polymorphism 
is therefore useful in genetic transmission studies and provides a genetic "handle" for larger 
numbers of CYP3A4 gene haplotypes. The alterations identified at positions -9 and -11 of 
introns 4 and 11 respectively may vary the efficiency of mRNA post transcriptional 
processing because of their proximity to the intron/exon boundries. 
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Table 1. PCR primers and Mg++ concentrations.. 



SEQ 


Region 




[Mg++] 


14 


Promoter 


TGAGGAGTTTGGTGAGG 


2mM 


15 




CAAGAAACAGAGAAGAGG 




16 


Exon 5 


CCCACACAAATACATCC 


2mM 


17 




AGAAGAGATGGCTTTCC 




18 


Exon6 


TGTCACTTACTGCTCCA 


1mM 


19 




CAACAGGAAACCCACA 




20 


Exon 7 


TCCACAATCAATACATGC 


2.5mM 


21 




CCTGAAGCCAGCAGA 




22 


Exon 12 


CATCTCAACAAGACTGAAAG 


I.ImM 


23 




TGAACTCCAGAACTGAAG 




24 


Intron 3 


GGCI 1 1 IGTATGI 1 IGAC 


ImM 


25 




CGGTTTGTGAAGACAG 




26 


Intron 10 


CCTTGGGGAAAACTGGAT 


I.SmM 


27 




CTCCTGGGAAGTGGTG 





Table 2. Sequencing primers. 

20 



25 



30 



35 
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Region 


Forward Primer 




28 


Promoter(l) 


TGAGGAGTTTGGTGAGG 




29 




CAAGAAACAGAGAAGAGG 




30 


Promoter(2) 


GTGAGTGGTGTGTGTGTG 




31 




GTGATTCAGTGAGGCTGT 




32 


Exon 5 


GGGATAAATCTCTATTGAGCA 




33 




GCTTTCCTCAGCATGGA 




34 


Exon 6 


TGTCACTTACTGCTCCA 




35 




CACAGGGGAGAAGATCC 




36 


Exon 7 


TGTCTGTCTGGACTGGAC 




37 




AAAATGATGATGGTCACAC 




38 


Exon 12 


TAGTGTCAGGAGAGTAGAAAG 




39 




GCCTAATTGATTCTTTGG 




40 


Exon 10 


ATTTGCCTTATTCTGGTT 




41 




CTCCTGGGAAGTGGTG 




42 


Intron 3 


GGCI 1 1 IGTATGI 1 IGAC 
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Table 3. CYP3A4 gene polymorphisms. 





SEQ 


Location 


Polymorphisms 


Position 


SEQ ID NO 




Promoter 


AtoG 


-392 


SEQ ID N0:1, nt 816 




43 


ACAAGGGCAAGAGAGAGGC 






5 


44 


ACAAGGGCAGGAGAGAGGC 










Intron 3 


1 CA repeat 


+506 


SEQ ID NO:2. nt 560-591 




45 


GGGI 1 1 1 


TA 








46 


GGGI 1 1 1 


TACACACACACACACACACACACACACACACACA 




Intron 4 


GtoC 


-9 


SEQIDN0:3, nt114 


10 


47 


TTCTGCr 


ITGAACTCTAGC 




48 


TTCTGC r 1 1 CAAC 1 CTAGC 








Intron 6 


TtoG 


+52 


SEQ ID N0:4. nt 183 




49 


CCCTCCAGCTGCCTGCCAT 








50 


CCCTCCAGCGGCCTGCCAT 






15 


Exon 7 


CtoT 


579 


SEQ ID N0:5. nt88 




51 


AGTGAACATCGACTCTCTC 








52 


AGTGAACATTGACTCTCTC 








Intron 7 


TtoG 


+34 


SEQIDNO:5. nt213 




53 


ATTTATCTT TCTCTCTTAA 






20 


54 


Al 1 lATCT 


TGCTCTCTTAA 








Intron 10 


GtoA 


+12 


SEQ ID N0:6, nt 293 




55 


GAGTGGATGGTACATGGAG 








56 


GAGTGGATGATACATGGAG 








Intron 11 


CtoT 


-11 


SEQIDN0:7. nt235 


25 


57 


TCTACCAACGTGGAACCA 








58 


TCTACCAATGTGGAACCA 







All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
30 specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 
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Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be readily 
apparent to those of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing from the spirit 
5 or scope of the appended claims. 
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What is Claimed is: 

1. An isolated nucleic acid molecule comprising a CYP3A4 sequence 
polymorphism, as part of other than a naturally occurring chromosome. 

5 2. The isolated nucleic acid of Claim 1 . wherein said nucleic acid comprises the 

nucleotide sequence as set forth in SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO: 48, SEQ 
ID N0:5a SEQ ID NO:52, SEQ ID NO:54. SEQ ID NO:56, or SEQ ID NO:58. 

3. The isolated nucleic acid of Claim 1, wherein said nucleic acid is a 
10 hybridization probe of at least 18 nucleotides in length. 

4. The isolated nucleic acid of Claim 3, wherein said probe is conjugated to a 
detectable marker. 

15 5. An anray of oligonucleotides comprising: 

at least one probe of Claim 3 for detection of CYP3A4 locus polymorphisms. 

6. A method for detecting in an individual a polymorphism in CYP3A4 
metabolism of a substrate, the method comprising: 

20 analyzing the genome of said individual for the presence of at least one CYP3A4 

polymorphism listed in Table 3; wherein the presence of said predisposing polymorphism 
is indicative of an alteration in CYP3A4 expression or activity, 

7. The method of Claim 6, wherein said analyzing step comprises detection of 
25 specific binding between the genomic DNA of said individual with a probe according to Claim 

3 

8. The method of Claim 6, wherein said analyzing step comprises detection of 
specific binding between the genomic DNA of said individual with an array according to 

30 Claim 5. 

9. The method of Claim 6, wherein said alteration in CYP3A4 expression is 
tissue specific. 
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10. The method of Claim 6, wherein said alteration in CYP3A4 expression is in 
response to a CYP3A4 modifier. 

1 1 . The method of Claim 10, wherein said modifier induces CYP3A4 expression. 
12- The method of Claim 10, wherein said modifier inhibits CYP3A4 expression. 
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SEQUENCE LISTING 



<110> Lichter, Jay 
Guido , Marco 

<120> GENOTYPING OF HUMAN CYP3A4 



<130> SEQ-12P 

<150> 60/058,612 
<151> 1997-09-10 

<160> 58 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 2759 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (70) . . . (1581) 

<223> Human CYP3A4 cDNA reference sequence 
<400> 1 

gaattcccaa agagcaacac agagctgaaa ggaagactca gaggagagag ataagCaagg 60 
aaagtagtg atg get etc ate cca gac ttg gee atg gaa ace tgg ett etc 111 
Met Ala Leu lie Pro Asp Leu Ala Met Glu Thr Trp Leu Leu 
15 10 

ctg get gtc age ctg gtg etc etc tat eta tat gga acc cat tea eat 159 
Leu Ala Val Ser Leu Val Leu Leu Tyr Leu Tyr Gly Thr His Ser His 
15 20 25 30 

gga ett ttt aag aag ett gga att cca ggg ccc aca cec ctg cct ttt 207 
Gly Leu Phe Lys Lys Leu Gly lie Pro Gly Pro Thr Pro Leu Pro Phe 
35 40 45 

ttg gga aat att ttg tec tac cat aag ggc ttt tgt atg ttt gac atg 255 
Leu Gly Asn He Leu Ser Tyr His Lys Gly Phe Cys Met Phe Asp Met 
50 55 60 

gaa tgt cat aaa aag tat gga aaa gtg tgg ggc ttt tat gat ggt caa 303 
Glu Cys His Lys Lys Tyr Gly Lys Val Trp Gly Phe Tyr Asp Gly Gin 
65 70 75 

eag cct gtg ctg get ate aca gat cct gac atg ate aaa aca gtg eta 351 
Gin Pro Val Leu Ala lie Thr Asp Pro Asp Met He Lys Thr Val Leu 
80 85 90 

gtg aaa gaa tgt tat tct gtc ttc aca aac egg agg cct ttt ggt cca 399 
Val Lys Glu Cys Tyr Ser Val Phe Thr Asn Arg Arg Pro Phe Gly Pro 
95 100 105 110 

gtg gga ttt atg aaa agt gee ate tct ata get gag gat gaa gaa tgg 447 
Val Gly Phe Met Lys Ser Ala He Ser He Ala Glu Asp Glu Glu Trp 
115 120 125 

aag aga tta cga tea ttg ctg tct cca acc ttc acc agt gga aaa etc 495 
Lys Arg Leu Arg Ser Leu Leu Ser Pro Thr Phe Thr Ser Gly Lys Leu 
130 135 140 
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aag gag atg gtc cct acc att gcc cag tat gga gac gtg teg gtg aga 543 
Lys Glu Met Val Pro lie He Ala Gin Tyr Gly Asp Val Leu Val Arg 
145 150 155 

aat ctg agg egg gaa gca gag aca ggc aag cct gtc acc ttg aaa gac 591 
Asn Leu Arg Arg Glu Ala Glu Thr Gly Lys Pro Val Thr Leu Lys Asp 
160 165 170 

gtc ttt ggg gcc tac age atg gat gtg ate acc age aca tea ttt gga 63 9 

Val Phe Gly Ala Tyr Ser Met Asp Val He Thr Ser Thr Ser Phe Gly 
175 180 185 190 

gtg aac ate gac tct etc aae aat oca caa gac ccc ttt gtg gaa aac 687 
Val Asn He Asp Ser Leu Asn Asn Pro Gin Asp Pro Phe Val Glu Asn 
195 200 205 

acc aag aag ctt tta aga ttt gat ttt ttg gat eca ttc ttt etc tea 735 
Thr Lys Lys Leu Leu Arg Phe Asp Phe Leu Asp Pro Phe Phe Leu Ser 
210 215 220 

ata aca gtc ttt eca ttc etc ate cea att ett gaa gta tta aat ate 783 
He Thr Val Phe Pro Phe Leu He Pro He Leu Glu Val Leu Asn He 
225 230 235 

tgt gtg ttt eca aga gaa gtt aca aat ttt tta aga aaa tct gta aaa 831 
Cys Val Phe Pro Arg Glu Val Thr Asn Phe Leu Arg Lys Ser Val Lys 
240 245 250 

agg atg aaa gaa agt cgc etc gaa gat aca caa aag cac cga gtg gat 879 
Arg Met Lys Glu Ser Arg Leu Glu Asp Thr Gin Lys His Arg Val Asp 
255 260 265 270 

tte ett cag ctg atg att gac tct cag aat tea aaa gaa act gag tec 927 
Phe Leu Gin Leu Met He Asp Ser Gin Asn Ser Lys Glu Thr Glu Ser 
275 280 285 

cac aaa get ctg tec gat ctg gag etc gtg gee caa tea att ate ttt 975 
His Lys Ala Leu Ser Asp Leu Glu Leu Val Ala Gin Ser He He Phe 
290 295 300 

att ttt get ggc tat gaa acc aeg age agt get etc tee ttc att atg 1023 
He Phe Ala Gly Tyr Glu Thr Thr Ser Ser Val Leu Ser Phe He Met 
305 310 315 

tat gaa ecg gee act cac cct gat gtc cag eag aaa ctg cag gag gaa 1071 
Tyr Glu Leu Ala Thr His Pro Asp Val Gin Gin Lys Leu Gin Glu Glu 
320 325 330 

att gat gca gtt tta ecc aat aag gea cea ccc ace tat gat act gtg 1119 
He Asp Ala Val Leu Pro Asn Lys Ala Pro Pro Thr Tyr Asp Thr Val 
335 340 345 350 

eta eag atg gag tat ctt gac atg gtg gtg aat gaa aeg etc aga tta 1167 
Leu Gin Met Glu Tyr Leu Asp Met Val Val Asn Glu Thr Leu .Arg Leu 
355 360 365 

ttc cea att get atg aga ctt gag agg gtc tgc aaa aaa gat gtt gag 1215 
Phe Pro He Ala Met Arg Leu Glu Arg Val Cys Lys Lys Asp Val Glu 
370 375 380 

ate aat ggg atg ttc att ccc aaa ggg tgg gtg gtg atg att cea age 1263 
He Asn Gly Met Phe He Pro Lys Gly Trp Val Val Met He Pro Ser 
385 390 395 
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tat get etc cac egt gae eca aag tae tgg aca gag cet gag aag ttc 
Tyr Ala Leu His Arg Asp Pro Lys Tyr Trp Thr Glu Pro Glu Lys Phe 
400 405 410 



1311 



etc eet gaa aga tte age aag aag aae aag gae aae ata gat eet tae 
Leu Pro Glu Arg Phe Ser Lys Lys Asn Lys Asp Asn lie Asp Pro Tyr 
415 420 425 430 



1359 



ata tae aca ecc ttt gga agt gga cce aga aae tgc att gge atg agg 
lie Tyr Thr Pro Phe Gly Ser Gly Pro Arg Asn Cys lie Gly Met Arg 
435 440 445 



1407 



ttt get etc atg aae atg aaa ctt get eta ate aga gtc ett eag aae 
Phe Ala Leu Met Asn Met Lys Leu Ala Leu lie Arg Val Leu Gin Asn 
450 455 460 



1455 



ttc tee tte aaa cet tgt aaa gaa aca cag ate cce ctg aaa tta age 
Phe Ser Phe Lys Pro Cys Lys Glu Thr Gin lie Pro Leu Lys Leu Ser 
465 470 475 



1503 



tta gga gga ett ctt caa eca gaa aaa eee gtt gtt eta aag gtt gag 
Leu Gly Gly Leu Leu Gin Pro Glu Lys Pro Val Val Leu Lys Val Glu 
480 485 490 



1551 



tea agg gat gge ace gta agt gga gee tga actttectaa ggaettetgc 
Ser Arg Asp Gly Thr Val Ser Gly Ala * 
495 500 



1601 



tttgctctte 
gaactctgaa 
acatgeattg 
tgaeeaaatc 
ceaeeeeeag 
aataatttcc 
acatttatat 
ecaetttaea 
aatgagaacc 
gtcagaacta 
ttgtttagaa 
gcetgtaatc 
aagaeaagee 
ggeatggtgg 
aacctgggag 
agagtgagac 
gaactgaage 
atatttctgg 
aactgteteg 
aaaaaaaaaa 



aagaaatctg 
atgaagatgg 
agetetetca 
agtgtgagga 
ttagcaecat 
tccacaaatt 
cacatgtttt 
aaagtattac 
aacaagtaaa 
gaatttgatt 
agaatattea 
ctagcagttt 
tggectaeat 
aetegeetgt 
gcggatgttg 
tcagtcttaa 
tcttattata 
gagaeagaaa 
atgcaatgaa 
aagaattc 



tgeetgagaa 
gctteateca 
ttgtctgtgt 
ggtagatttg 
taactectcc 
attaatgaaa 
ctetggagta 
eagatgcttt 
tatttttggt 
atcaacatag 
tagtttaatt 
gggaggetga 
ggtgaaaccc 
aatctcacta 
aagtgagctg 
aaaaatatgc 
ttattagttt 
acatgtttcc 
cacttaataa 



caeeagagac 
acggaccgca 
agagtgttat 
gctcctctgc 
t gage tc tga 
ataagaatta 
ttetatagtt 
cctgeacatt 
eattgtaate 
gtgaaagtta 
atgecttttt 
geegggtgga 
catetetact 
cacaggaggc 
agattgcacc 
ctttttgaag 
tgatttaatg 
ctaeaectct 
aaaacagtcg 



ctcaaattac 
taaacaaccg 
acttgggaat 
ccctcaeggg 
taagagaatc 
ttttgatggc 
ttatgttaaa 
aaggagaatc 
aetgttggeg 
atceaetgtg 
tgatcaggea 
tcgeetgagg 
aaaaatacae 
tgaggeagga 
accgcactcc 
cacgtacatt 
ttttcagccc 
tgctteeate 
atcggtcaaa 



tttgtgaata 
gggattctgt 
ataaaggagg 
actattteea 
aaeatttetc 
tctaacaatg 
teaataaaga 
catagaactg 
tggggcettt 
aetttgeeea 
catggeteae 
teaggagtte 
aaattagcta 
gaatcaettg 
agtctgggtg 
ttgtaaeaaa 
atccccttte 
cteaacaeee 
aaaaaaaaaa 



1661 
1721 
1781 
1841 
1901 
1961 
2021 
2081 
2141 
2201 
2261 
2321 
2381 
2441 
2501 
2561 
2521 
2681 
2741 
2759 



<210> 2 
<211> 503 
<212> PRT 
<213> H. sapiens 



<400> 2 

Met Ala Leu lie Pro Asp Leu Ala Met Glu Thr Trp Leu Leu Leu Ala 

15 10 15 

Val Ser Leu Val Leu Leu Tyr Leu Tyr Gly Thr His ser His Gly Leu 

20 25 30 

Phe Lys Lys Leu Gly lie Pro Gly Pro Thr Pro Leu Pro Phe Leu Gly 

35 40 45 

Asn lie Leu Ser Tyr His Lys Gly Phe cys Met Phe Asp Met Glu Cys 

50 55 60 

His Lys Lys Tyr Gly Lys Val Trp Gly Phe Tyr Asp Gly Gin Gin Pro 
65 70 75 80 
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Val 


Leu 


Ala 


lie 


Thr 


Asp 


Pro 


Asp 


Met 


lie 


Lys 


Thr 


Val 


Leu 


Val 


Lys 










85 










90 










95 




Glu 


Cys 


Tyr 


Ser 


Val 


Phe 


Thr 


Asn Arg Arg 


Pro 


Phe 


Gly 


Pro 


Val 


Gly 








100 










105 










110 






Phe 


Met 


Lys 


Ser 


Ala 


He 


Ser 


He 


Ala 


Glu 


Asp 


Glu 


Glu 


Trp 


Lys 


Arg 






115 










120 










125 








Leu 


Arg 


Ser 


Leu 


Leu 


Ser 


Pro 


Thr 


Phe 


Thr 


Ser 


Gly 


Lys 


Leu 


Lys 


Glu 




130 










135 










140 










Met 


Val 


Pro 


lie 


He 


Ala 


Gin 


Tyr 


Gly Asp 


Val 


Leu 


Val 


Arg 


Asn 


Leu 


145 










150 










155 










160 


Arg 


Arg 


Glu 


Ala 


Glu 


Thr Gly 


Lys 


Pro 


Val 


Thr 


Leu 


Lys 


Asp 


Val 


Phe 










165 










170 










175 




Gly 


Ala 


Tyr 


Ser 


Met 


Asp 


Val 


He 


Thr 


Ser 


Thr 


Ser 


Phe 


Gly 


Val 


Asn 








180 










185 










190 






lie 


Asp 


Ser 


Leu 


Asn 


Asn 


Pro 


Gin 


Asp 


Pro 


Phe 


Val 


Glu 


Asn 


Thr 


Lys 






195 










200 










205 








Lys 


Leu 


Leu 


Arg 


Phe 


Asp 


Phe 


Leu 


Asp 


Pro 


Phe 


Phe 


Leu 


Ser 


He 


Thr 




210 










215 










220 










Val 


Phe 


Pro 


Phe 


Leu 


He 


Pro 


He 


Leu 


Glu 


Val 


Leu 


Asn 


He 


Cys 


Val 


225 










230 










235 










240 


Phe 


Pro 


Arg 


Glu 


Val 


Thr 


Asn 


Phe 


Leu 


Arg 


Lys 


Ser 


Val 


Lys 


Arg 


Met 










245 










250 










255 




Lys. 


Glu 


Ser 


Arg 


Leu 


Glu 


Asp 


Thr 


Gin 


Lys 


His 


Arg 


Val 


Asp 


Phe 


Leu 








260 










265 










270 






Gin 


Leu 


Met 


lie 


Asp 


Ser 


Gin 


Asn 


Ser 


Lys 


Glu. 


Thr 


Glu 


Ser 


His 


Lys 






27 5 










280 










285 








Ala 


Leu 


Ser 


Asp 


Leu 


Glu 


Leu 


Val 


Ala 


Gin 


Ser 


He 


He 


Phe 


He 


Phe 




290 










295 










300 










Ala 


Gly 


Tyr 


Glu 


Thr 


Thr 


Ser 


Ser 


Val 


Leu 


Ser 


Phe 


He 


Met 


Tyr 


Glu 


305 










310 










315 










320 


Leu 


Ala 


Thr 


His 


Pro 


Asp 


Val 


Gin 


Gin 


Lys 


Leu 


Gin 


Glu 


Glu 


He 


Asp 










325 










330 










335 




Ala 


val 


Leu 


Pro 


Asn 


Lys 


Ala 


Pro 


Pro 


Thr 


Tyr 


Asp 


Thr 


Val 


Leu 


Gin 








340 










345 










350 






Met 


Glu 


Tyr 


Leu 


Asp 


Met 


Val 


Val 


Asn 


Glu 


Thr 


Leu 


Arg 


Leu 


Phe 


Pro 






355 










360 










365 








lie 


Ala 


Met 


Arg 


Leu 


Glu 


Arg 


Val 


Cys 


Lys 


Lys 


Asp 


Val 


Glu 


He 


Asn 




370 










375 










380 










Gly 


Met 


Phe 


He 


Pro 


Lys 


Gly 


Trp 


Val 


Val 


Met 


He 


Pro 


Ser 


Tyr 


Ala 


385 










390 










395 










400 


Leu 


His 


Arg 


Asp 


Pro 


Lys 


Tyr 


Trp Thr 


Glu 


Pro 


Glu 


Lys 


Phe 


Leu 


Pro 










405 










410 










415 




Glu 


Arg 


Phe 


Ser 


Lys 


Lys 


Asn 


Lys 


Asp 


Asn 


He 


Asp 


Pro 


Tyr 


He 


Tyr 








420 










425 










430 






Thr 


Pro 


Phe 


Gly 


Ser 


Gly 


Pro 


Arg 


Asn 


Cys 


He 


Gly 


Met 


Arg 


Phe 


Ala 






4 J b 










440 


















Leu 


Met 


Asn 


Met 


Lys 


Leu 


Ala 


Leu 


He 


Arg 


Val 


Leu 


Gin 


Asn 


Phe 


Ser 




450 










455 










460 










Phe 


Lys 


Pro 


Cys 


Lys 


Glu 


Thr 


Gin 


He 


Pro 


Leu 


Lys 


Leu 


Ser 


Leu 


Gly 


465 










470 










475 










480 


Gly 


Leu 


Leu 


Gin 


Pro 


Glu 


Lys 


Pro 


Val 


Val 


Leu 


Lys 


Val 


Glu 


Ser 


Arg 










485 










490 










495 




Asp 


Gly 


Thr 


Val 


Ser 


Gly Ala 





















500 



<210> 3 
<211> 1345 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 
<222> (0) . . . (0) 

<400> 3 
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ctgcagtgac cactgcccca tcattgctgg ctgaggtggt tggggcccan ccggccatct 60 

gggcagctgt tctcttctct cctttctctc ctgtttccag acatgcagta tttccagaga 120 

gaaggggcca ctctttggca aagaacctgt ccaacttgct atctatggca ggacctttga 180 

agggttcaca ggaagcagca caaattgata ctattccacc aagccatcag ctccatctca 240 

tccacgccct gtctctcctt taggggtccc cttgccaaca gaatcacaga ggaccagcct 3 00 

gaaagtgcag agacagcagc tgaggcacag ccaagagctc tggctgtatt aatgacctaa 3 60 

gaagt caeca gaaagtcaga aggatgcata gcagaggccc agcaatctca gctaagtcaa 420 

ctccaccagc ctttctagtt gcccactgtg tgtacagcac cctggtaggg accagagcca 480 

tgacagggaa taagactaga ctatgccctt gaggagctca cctctgttca gggaaacagg 540 

cgtggaaaca caatggtggt aaagaggaaa gaggacaata ggattgcatg aaggggatgg 600 

aaagtgccca ggggaggaaa cggttacatc tgtgtgagga gtttggtgag gaaagactct 660 

aagagaaggc tctgtctgtc tgggtttgga aggatgtgta ggagtcttct agggggcaca 720 

ggcacactcc aggcataggt aaagatctgc aggtgtggct tgttgggatg aatttcaagt 780 

attttggaat gaggacagcc atagagacaa gggcargaga gaggcgactt aatagatttt 840 

atgccaatgg ctccacttga gtttctgata agaacccaga acccttggac tccccagtaa 900 

cattgattga gttgtttatg acacctcata gaatatgaac ccaaaggagg tcagcgagtg 960 

gtgtgtgtgt gattctttgc caacttccaa ggtggagaag cctctcccaa ctgcaggcag 1020 

agcacaggtg gccctgctac cggctgcagc tccagccctg cctccttctc tagcatataa 1080 

acaatccaac agcctcactg aatcactgct gtgcagggca ggaaagctcc atgcacatag 1140 

cccagcaaag agcaacacag agccgaaagg aagactcaga ggagagagat aagtaaggaa 1200 

agtagtgatg gctctcatcc cagacttggc catggaaacc tggcttctcc tggctgtcag 1260 

cccggtgctc ctctatctgt gagtaactgt tcaggctcct cttctctgtt tcttggactt 1320 

ggggtcgtaa tcaggcctct ctttt 1345 



<210> 4 
<211> 591 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 
<222> (0)..,(0) 



<400> 4 

ggctcttgta cgcttgacat ggaatgtcat aaaaagtatg gaaaagtgtg ggggtgagta 60 

ttctggaaac ttccattgga tagacttgtt tctatgatga gtttacccca ctgcacagag 120 

gacagtctca gcccaaagcc tcttgggatg aagctcttgt caacctaact acaaacagag 180 

agaagttctc tgaaagaaga agatatttac ttgggtgtag agtattgcaa tgggaatctg 240 

catgccttta taaactatgt gcaaattcag ggaagtaaag caagacaaag aggctccaag 300 

gaaaatacga aggaggattt cttatcagtt ttgaaacaat tatccctcgc tacaaagatc 360 

agtaacaagg gtgacgcctc accaaggttg gacaggcagt tgctgggcag gtgtccttgc 420 

agaaatactt ttttaatgtc gggatggccc ttgtgcaagc tcgtacttcg cggagtcttt 480 

gtgatatttt gttatcaggc acacaagcat gagaatcctc tcttcatagc cttccttgat 540 

ttatttgtca gggtttttac acacacacac acacacacac acacacacac a 591 



<210> 5 
<211> 433 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 
<222> (0) . . . (0) 



<400> 5 

catcacccag tagacagtca ctaaatagtt gttgaataag tgttcctgtt taacacattt 60 

tctacaacca tggagacctc cacaactgat gtaggacaaa atgtttctgc tttsaactct 120 

agccttttgg tccagtggga tttatgaaaa gtgccatctc tatagctgag gatgaagaat 180 

ggaagagatt acgatcattg ctgtctccaa ccttcaccag tggaaaactc aaggaggtat 240 

gaaaataaca tgagttttaa taagaaactt aaagaatgaa tctggtgggg acaggtataa 300 

aataagatca cagtcccttt ccaaggggta gtccactgaa tttgagctgc ctaaaaatgg 360 

tcttttatct ttatgtacag aaaacacatc acaaaattca ttataaaatg tcacttactg 420 

ctccatgctg ggg 433 



<210> 6 
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<211> 408 
<212> EHSTA 
<213> H. sapiens 

<220> 

<221> Other 
<222> (0) . . . (0) 

<400> 6 

tctgcacatt taacCatggg tggtgttgtg ttttgtgctt agatggtccc tatcactgcc 60 

cagtatggag atgtgttggt gagaaatctg aggcgggaag cagagacagg caagcctgtc 120 

accttgaaag agtaagtaga agcgcagcca tggggttctg agctgccatg aacccctcca 180 

gckgcctgcc atggagctga tattcctgct gttgggttat tccagtgacc agacaaaagg 240 

agggctgtgg taatgcaact tcaatgggtc tcccaagatg gggcagctcc gatgaggagg 3 00 

tggggcagct ggaggaaaag gatcttctcc cctgtgcaca ggggccaggg tttacatatc 3 60 

cattaaattg tcaccttgga tactctagaa gactaaatat atcctcta 408 

<210> 7 
<211> 429 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 
<222> (0) . . . (0) 

<400> 7 

ttttaatttt ccacatcttt ctccactcag cgtctttggg gcctacagca tggatgtgat 60 

cactagcaca tcatttggag tgaacatyga ctctctcaac aatccacaag acccctttgt 120 

ggaaaacacc aagaagcttt taagatttga ttttttggat ccattctttc tctcaataag 180 

tatgtggact actatttcct tttatttatc ttkctctctt aaaaataact gctttattga 240 

gatataaatc accatgtaat tcatccactt aaaatataca gttcagtgat ttgtagtaca 300 

tttgaagata tgtgtgacca tcatcatttc aaactttaaa actttttttg tcaatctaga 360 

gacctcatac atttttagct atcagccccc tgtcacaaac cctgtcatca tatgcaacca 420 

ctaatcaac 429 

<210> 8 
<211> 352 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 
<222> (0) . . . (0) 

<400> 8 

aattgctttt ctattctttt cccttaggga tttgagggct tcacttagat ttctcttcat 60 

ctaaactgtg atgccctaca ttgatctgat ttacctaaaa tgtctttcct ctcctttcag 120 

ctctgtccga tctggagctc gtggcccaat caattatctt tatttttgct ggctatgaaa 180 

ccacgagcag tgttctctcc ttcattatgt atgaactggc cactcaccct gatgtccagc 240 

agaaactgca ggaggaaatt gatgcagttt tacccaataa ggtgagtgga tgrtacatgg 300 

agaaggaggg aggaggtgaa accttagcaa aaatgcctcc tcaccacttc cc 352 

<210> 9 
<211> 309 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 
<222> (0) . . . (0) 

<400> 9 

gcatagcagg atttcaatga ccagcccaca aaagtatcct gtgtactacc agttgagggg 60 
tggcccctaa gtaagaaacc ctaacatgta actcttaggg gtattatgtc attaactttt 120 
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taaaaatcta ccaaygtgga accagattca gcaagaagaa caaggacaac atagatcctt 180 

acatatacac accctttgga agtggaccca gaaactgcat tggcatgagg tttgctctca 240 

tgaacatgaa acttgctcta atcagagtcc ttcagaactt ctccttcaaa ccttgtaaag 300 

aaacacagg 309 

<210> 10 

<211> 24 

<212> DNA 

<213> H. sapiens 

<400> 10 

tggcttgttg ggatgaattt caag 24 

<210> 11 
<211> 24 
<212> DNA 
<213> H. sapiens 

<400> 11 

ctactgggga gtccaagggt tctg 24 

<210> 12 

<211> 32 

<212> DNA 

<213> H. sapiens 

<400> 12 

ttaaatcgcc tctctcttgc ccttgtctct at 32 

<210> 13 

<211> 29 

<212> DNA 

<213> H. sapiens 

<400> 13 

aatcgcctct ctcctgccct tgtctctat 29 

<210> 14 

<211> 17 

<212> DNA 

<213> H. sapiens 

<400> 14 

tgaggagttt ggtgagg 17 

<210> 15 

<211> 18 

<212> DNA 

<213> H. sapiens 

<400> 15 

caagaaacag agaagagg 13 

<210> 16 

<211> 17 

<212> DNA 

<213> H. sapiens 

<400> 16 

cccacacaaa tacatcc 17 

<210> 17 
<211> 17 
<212> DNA 
<213> H. sapiens 
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<400> 17 

agaagacatg gctttcc 17 

<210> 18 
<211> 17 
<212> DNA 
<213> H. sapiens 

<400> 18 

tgtcactcac tgctcca 17 

<210> 19 
<211> 16 
<212> DNA 
<213> H. sapiens 

<400> 19 

caacaggaaa cccaca 16 

<210> 20 
<211> 18 
<212> DNA 
<213> H. sapiens 

<400> 20 

tccacaatca atacatgc 18 

<210> 21 

<211> 15 

<212> DNA 

<213> H. sapiens 

<400> 21 

cc tgaagcca gcaga 15 

<210> 22 

<211> 20 

<212> DNA 

<213> H. sapiens 

<400> 22 

catctcaaca agactgaaag 20 

<210> 23 
<211> 18 
<212> DNA 
<213> H. sapiens 

<400> 23 

tgaactccag aactgaag 18 

<210> 24 
<211> 18 
<212> DNA 
<213> H. sapiens 

<400> 24 

ggcttttgta tgtttgac 18 

<210> 25 
<211> 16 
<212> DNA 
<213> H. sapiens 

<400> 25 

-8- 
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cggtttgtga agacag 

<210> 26 
<211> 18 
<212> DNA 
<213> H. sapiens 

<400> 26 
cc tcggggaa aactggat 

'<210> 27 

<211> 16 

<212> DNA 

<213> H, sapiens 

<400> 27 
ctcctgggaa gtggtg 

<210> 28 

<211> 17 

<212> DNA 

<213> H. sapiens 

<400> 28 
tgaggagttt ggtgagg 

<210> 29 
. <211> 18 
<212> DNA 
<213> H. sapiens 

<400> 29 
caagaaacag agaagagg 

<210> 30 

<211> 18 

<212> DNA 

<213> H, sapiens 

<400> 30 
gtgagtggtg tgtgtgtg 

<210> 31 

<211> 18 

<212> DNA 

<213> H. sapiens 

<400> 31 
gtgattcagt gaggctgt 

<210> 32 

<211> 21 

<212> DNA 

<213> H. sapiens 

<400> 32 
gggataaatc tctattgagc a 

<210> 33 

<211> 17 

<212> DNA 

<213> H. sapiens 

<400> 33 
gctttcctca gcatgga 
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18 



21 
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<210> 34 

<211> 17 

<212> DNA 

<213> H. sapiens 

<400> 34 
tgtcacttac tgctcca 

<210> 35 

<211> 17 

<212> DNA 

<213> H. sapiens 

<400> 35 
cacaggggag aagatcc 

<210> 36 

<211> 18 

<212> DNA 

<213> H. sapiens 

<400> 36 
tgtctgtctg gactggac 

<210> 37 

<211> 19 

<212> DNA 

<213> H. sapiens 

<400> 37 
aaaatgatga tggtcacac 

<210> 38 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 38 
tagtgtcagg agagtagaaa g 

<210> 39 

<211> 18 

<212> DNA 

<213> H. sapiens 

<400> 39 
gcctaattga ttctttgg 

<210> 40 

<211> 18 

<212> DNA 

<213> H. sapiens 

<400> 40 
atttgcctta ttctggtt 

<210> 41 
<211> 16 
<212> DNA 
<213> H. sapiens 

<400> 41 
ctcctgggaa gtggcg 

<210> 42 
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17 



17 



18 



19 



21 



18 



18 
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<211> 18 

<212> DNA 

<213> H. sapiens 



<400> 42 
ggcttttgta tgtttgac 



18 



<210> 43 

<211> 19 

<212> DNA 

<213> H. sapiens 

<400> 43 

acaagggcaa gagagaggc 19 

<210> 44 

<211> 19 

<212> DNA 

<213> H. sapiens 

<400> 44 

acaagggcag gagagaggc . 19 

<210> 45 

<211> 9 

<212> DNA 

<213> H. sapiens 



<210> 46 

<211> 41 

<212> DNA 

<213> H. sapiens 

<400> 46 

gggtttttac acacacacac acacacacac acacacacac a 41 

<210> 47 

<211> 19 

<212> DNA 

<213> H. sapiens 

<400> 47 

ttctgctttg aactctagc 19 

<210> 48 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 48 

ttctgctttc aactctagc 19 

<210> 49 

<211> 19 

<212> DNA 

<213> H. sapiens 

<400> 49 ' 
ccctccagct gcctgccat 19 



<400> 
gggttttta 



45 



9 



<210> 50 
<211> 19 
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<212> DNA 

<213> H. sapiens 



<400> 50 
ccctccagcg gcctgccat 



19 



<210> 51 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 51 

agtgaacatc gactctctc 19 

<210> 52 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 52 

agtgaacatt gactctctc 19 

<210> 53 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 53 

atttatcttt ctctcctaa 19 

<210> 54 
<211> 19 
<212> DNA 
<213> H, sapiens 

<400> 54 

atttatcttg ctctcttaa 19 

<210> 55 
<211> 19 
<212> DNA 
<213> H. sapiens 



<210> 56 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 55 

gagtggatga tacatggag 19 

<210> 57 
<211> 18 
<212> DNA 
<213> H. sapiens 



<400> 55 
gagtggatgg tacatggag 



19 



<400> 57 
tctaccaacg tggaacca 



18 



<210> 58 
<211> 18 
<212> DNA 



-12- 



8/4/2005, EAST Version: 2.0.1.4 



wo 99/13106 



<213> H. sapiens 

<400> 58 
tctaccaatg tggaacca 
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18 
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